Deep learning-based noise robust flexible piezoelectric acoustic sensors for speech processing
نویسندگان
چکیده
Flexible piezoelectric acoustic sensors (f-PAS) have attracted significant attention as a promising component for voice user interfaces (VUI) in the era of artificial intelligence things (AIoT). The signal distortion issue highly sensitive biomimetic f-PAS is one most challenging obstacle real-life application, due to fundamental difference compared with conventional microphones. Here, noise-robust flexible sensor (NPAS) demonstrated by designing multi-resonant bands outside noise dominant frequency range. Broad coverage up 8 kHz achieved adopting an advanced membrane (Nb-doped PZT; PNZT) optimized polymer ratio. Deep learning-based speech processing multi-channel NPAS show outstanding improvement speaker recognition and enhancement commercial microphone. Finally, filtered crowd condition noises, showing independent speaker’s speeches can be identified digitalized simultaneously. To fabricate (NPAS), are designed range, via resonance mechanism. material dimensional effect analysis. • We was Nb-doped PZT membrane. our showed
منابع مشابه
Flexible, Robust, and Efficient Human Speech Processing
Present-day speech technology systems try to perform equally well or preferably even better than humans under specific conditions. For more complex tasks machines frequently show degraded performance, because their flexibility, robustness and efficiency is lower than that of humans. In order to better understand the system limitations and perhaps further improve system performance, one can try ...
متن کاملA Statistical Model-Based Speech Enhancement Using Acoustic Noise Classification for Robust Speech Communication
In this paper, we present a speech enhancement technique based on the ambient noise classification that incorporates the Gaussian mixture model (GMM). The principal parameters of the statistical modelbased speech enhancement algorithm such as the weighting parameter in the decision-directed (DD) method and the long-term smoothing parameter of the noise estimation, are set according to the class...
متن کاملImproved Example-Based Speech Enhancement by Using Deep Neural Network Acoustic Model for Noise Robust Example Search
Example-based speech enhancement is a promising singlechannel approach for coping with highly nonstationary noise. Given a noisy speech input, it first searches in a noisy speech corpus for the noisy speech examples that best match the input. Then, it concatenates the clean speech examples that are paired with the matched noisy examples to obtain an estimate of the underlying clean speech compo...
متن کاملFactored Deep Convolutional Neural Networks for Noise Robust Speech Recognition
In this paper, we present a framework of a factored deep convolutional neural network (CNN) learning for noise robust automatic speech recognition (ASR). Deep CNN architecture, which has attracted great attention in various research areas, has also been successfully applied to ASR. However, to ensure noise robustness, since merely introducing deep CNN architecture into the acoustic modeling of ...
متن کاملRobust Features in Deep-Learning-Based Speech Recognition
Recent progress in deep learning has revolutionized speech recognition research, with Deep Neural Networks (DNNs) becoming the new state of the art for acoustic modeling. DNNs offer significantly lower speech recognition error rates compared to those provided by the previously used Gaussian Mixture Models (GMMs). Unfortunately, DNNs are data sensitive, and unseen data conditions can deteriorate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Nano Energy
سال: 2022
ISSN: ['2211-3282', '2211-2855']
DOI: https://doi.org/10.1016/j.nanoen.2022.107610